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Abstract 

We consider the problem of testing small set expansion in general graphs. A graph G is a (fc, <fi)- 
expander if every subset of volume at most k has conductance at least <f>. Small set expansion has recently 
received significant attention due to its close connection to the unique games conjecture and the local 
graph partitioning algorithms. 

We give a one-sided error tester that takes as input an n-vertex graph G, a volume bound k, an 
expansion bound <fi and a distance parameter e > 0. The tester accepts the graph if it is a (k, </>)-expander 
and rejects it with high probability if it is e-far from any (k*, </>*)-expander, where k* = 0(fc 1_ £) and 
4>* = 0(£0 2 ) for any < £ < 1/2. Furthermore, whenever the tester rejects a graph it outputs a 
certificate to the fact that G is not a (k, 0)-expander in form of a subset with volume at most k and 
conductance less than <f). The running time of the tester is 0{ ^fi ), which is sub linear in n for k — 
(9( log i"i) - ). Our methods combine the newly developed analysis techniques of lazy random walks for 
local graph partitioning algorithms and techniques for testing expansion in general graphs. 

1 Introduction 

Graph property testing is one of the most effective algorithmic paradigms to process real-world networks, 
the scale of which has become so large that it is even impractical to read the whole input. In the setting of 
testing a graph property P, we are given as input a graph G and we want to design an algorithm (called 
tester) to distinguish the case that G has property P from the case that G is "far from" the property P. 
Here, the notion of being "far from" is parameterized by a distance parameter e. In most situations, a 
graph G is said to be e-far from property P if one has to modify at least an e fraction of edges of G to 
obtain a graph G' with property P. Typically, given oracle access to the input graph G, we want to design 
a property tester that only queries a small portion of G, which often leads to sublinear time algorithms. 

In the last fifteen years, many testers have been developed for different graph properties, such as k- 
colorability, bipartiteness, acyclicity, triangle-freeness and many others (see the survey [Goll0|). Most 
of these testers apply only to the adjacency matrix model or the adjacency list model, depending on the 
types of queries the testers are allowed to perform to the oracle. The former model is most suitable for 
dense graphs and general characterizations on the testability of a property in this model has been given 
(e.g., | AFNS09I). The latter model is most suitable for sparse graphs, and several property testers using 
the techniques of local search or random walks are known, while it is not well understood what properties 
are testable in constant in this model. Even less is known about testers or testability results for general 
graphs. Existing property testing algorithms in this model usually resort to reductions that substitute high 
degree vertices with expanders or transform the graph into a bound degree graph, on which the testers 
are already known or can be easily designed (e.g., [KKR04 LPPlQ). 
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In this paper, we consider the problem of testing small set expansion and give a tester using recent 
new analysis techniques of lazy random walks in the design of local graph partitioning algorithms and 
do not reduce our problem to other classes of graphs. Given a graph G = (V, E) with n vertices 
and m edges, and a set S C V, let the volume of S be the sum of degree of vertices in S, that is, 
vol(5) := X^es^( v )> where d(v) denotes the degree of vertex v. Define the conductance of S as 
4>(S) :— ypf('g) , where e(S, S) is the number of edges leaving S; and define the expansion profile of 
G as 4>(k) := mms: V0 i(S)<k 4>(S). A graph G is called a (k, </>)-expander if <j>(k) > <j>, that is, all the 
subsets in G with volume at most k have conductance at least cj>. We will refer to small set expander as 
(k, </>)-expander and refer to small set expansion as <f>(k) for k much smaller than vol(V)/2 = rn. 

Besides of the relation to the mixing time of random walks [LK99], small set expansion has been of 
much interest recently for its close connection to the unique games conjecture I1RS10IIABS10I . and the 
design of local graph partitioning algorithms in massive graphs [ST08 ACL06 AP091 IOT12I IKL121 . 
Approximation algorithms and spectral characterizations for the small set expansion problem have been 
studied HABS10I ILRTV12I ILOT12I IKLTH IDtTH lOWTH . It is natural to ask if one can efficiently (in 
sublinear time) test if a graph is a small set expander. 

We give a tester for small set expansion in a model for the general graphs. The model assumes 
access to the graph by an oracle, which allows both degree queries and neighbor queries that query the 
degree and the i-th neighbor of some vertex, respectively. We use the common definition of distance 
between graphs. More precisely, a graph G with m edges is said to be e-far from a (k, </>)-expander if 
one has to modify at least em edges of G so that it becomes a (k, </))-expander. We will assume that e is 
a sufficiently small constant. We present our small set expansion testing algorithm called SSETester 
in SectionfJ] Our main result of the tester is the following theorem. 

Theorem 1 (Main). Given as input an n-vertex graph, a volume bound k, a conductance bound <f>, and 
a distance parameter e, the algorithm SSETester accepts any graph that is a (k, <j))-expander, and 
rejects with high probability any graph that is e-far from any [k* , (f>*)-expander, where k* — 0(/c 1- ^) 
and 4>* — O(£0 2 ) for any < £ < 1/2. Furthermore, whenever it rejects a graph, it provides a 
certificate that the graph is not a (k, <j))-expander in the form of a set of volume at most k and expansion 
at most 4>. The running time of the algorithm is 0( ^ ). 

Note that our testing algorithm has one-sided error, which means that the tester always accepts every 
(k, </>) -expander, and it runs in time sublinear in n for k — 0{ lo n"i) ~ )■ 

1.1 Our Techniques 

As aforementioned, we will adopt recent new analysis techniques for lazy random walks in the design of 
local graph partitioning algorithms instead of trying to reduce the problem to testing small set expansion 
in other classes of graphs. We first perform multiple random walks on the input graph G from some 
randomly chosen vertex v and show that we can get empirical distributions that approximate well the 
true probability distributions of lazy random walks of length t. Then by using similar arguments for 
local graph partitioning algorithms, we can prove that if the start vertex v is "weak" in some sense, then 
a sweep process over the empirical distributions can find a small set with small expansion. Finally, we 
show that if the input graph is far from any small set expander, then there are at least a constant fraction 
(measured in volume) of weak vertices, so that with high probability, we will sample out a weak vertex 
and thus find a small non-expanding set. In the proof of the final part, we show that if there are few weak 
vertices, then we can modify at most em edges of G to get a small set expander by a known patch-up 
algorithm, which has been used in the analysis of the expansion testers. 

'"with high probability" means with probability at least 2/3. 



2 



1.2 Related Works 



There is an interesting line of research on testing the special case of the (k, (f>) -expander for k = m, which 
is often abbreviated as <p-expander. (And cf>(m) is often abbreviated as the expansion (or conductance) of 
G |HLW06 |.) Goldreich and Ron [GR02| have proposed an expansion tester for bounded degree graphs. 
The tester (with different setting parameters) has later been analyzed by Czumaj and Sohler DCS 071 , 
Nachmias and Shapira |NS10|, and Kale and Seshadhri IIKS1 lbl . and it is proven that the tester can 
distinguish d-regular ^-expanders from graphs that are e-far from any e?-regular 51 ((f) 2 ) -expanders. The 
running time of the tester is O(n - 5+v (e^ 1 logn)^ 1 )) for any r\ > 0, which is almost optimal. Li, Pan 
and Peng BLPP1 11 give an expansion tester for general graphs that matches the best known tester for 
bounded degree graphs. We note that our tester is not a sublinear expansion tester for the general graphs, 
since our algorithm try to find a set with small conductance and it will take time at least proportional to 
the volume of output set. Especially, when the volume parameter k = m, it runs in time Oim log 6 m). 

The techniques of random walks have also been used to test bipartiteness under different mod- 
els BKKR04I ICMOS 1 Tl . The idea of using multiple random walks on graphs to estimate the true prob- 
ability distribution of random walks of length t and then to find sparse cuts has been used by Kale 
and Seshadhri HKS1 lal and Das Sarma, Gollapudi and Panigrahy [DGP09 1 in the design of local graph 
algorithms and graph stream models, respectively. 

1.3 Organization 

The rest of the paper is organized as follows. In Section|2]we give some basic definitions and introduce 
the tool of random walks. Then we present our tester and bound its running time in SectionfJ] We give 
the proof of the soundness of the tester and thus complete the proof of the main theorem in Section|4] In 
Section|5] we discuss some concluding remarks and open problems. 

2 Preliminaries 

Let G = (V, E) be an undirected graph with n vertices and m vertices. Let d(v) denote the degree of 
a vertex v. We assume that we can access to the graph G by an oracle which allows both the degree 
queries and neighbor queries, that is, for any vertex v, one can query in constant time the degree of v 
and the z-th neighbor of v. 

For a vertex subset S C V, let 6(5*, V\S) be the number of edges leaving S. Let vol(S) := 
X^gs d(v) and 4>(S) := e(S, S)/vol(S) be the volume of S and the conductance of S, respectively. 
Note that vol(G) := vol(V) = 2m. Define the expansion profile of G as <j>(k) := mixis :vo us)<k 4>(S). 
In particular, (j)(m) is often referred to the conductance (or expansion) ofG. 

Definition 1. A graph G is a (fc, <j))-expander if (j)(k) > <f>. Equivalently, G is a (k, (p)-expander if for 
every S C V with volume vol(S) < k has conductance cj>(S) > 4>. 

We have the following definition of graphs that are e-far from (k, 0)-expanders. 

Definition 2. A graph G is e-far from a (k, <j))-expander if one has to modify at least em edges of G to 
obtain a (k, 4>)- expander. 

The main tool in our algorithm and analysis is the lazy random walk on G. In a lazy random walk, 
if we are currently at vertex v, then in the next step, we choose a random neighbor u with probability 
l/2d(v) and move to u. With the remaining probability 1/2, we stay at v. Let A, D denote the adjacency 
matrix and diagonal degree matrix of G, respectively. Then W := (I+D^ 1 A) /2 is probability transition 
matrix of the lazy random walk of G, where / is the identity matrix. 

We will use bold letters to denote row vectors. Let p be an initial probability distribution on vertices. 
The probability distribution after t steps of lazy random walks is defined as p t = PgVK*. Let l v denote 
the indicator vector on vertex v. For any vector p, let p(5) := X^es P( w )- Recently, Oveis Gharan and 
Trevisan proved the following lemma to bound the escaping probability of random walks. 



3 



Lemma 1 (|0T12|). For any set S C V and integer t > 0, there exists S t C S such that vol(S l ) > 
vol(S)/2andforanyv € S t ,p t '.— lyW 1 , we have p t (S) > Ci(l — ^ip-) 1 for some constant ci > 0. 

Lovasz and Simonovits [LS90 LS93 1 introduced the following potential function to analyze random 
walks, which has been widely used to design local graph partitioning algorithms I ST08. IACL061 [ AP09 , 
IOTT2llKLT2l . 

For any vector p and < x < 2m, let 7(p, x) := max w6 x> x Y^vpv P( v ) yv ( v )^ where T> x = {w|w E 
[0, 1]", Yf(v)d(v) = x}. Note that if we order the vertices so that > ■ ■ ■ > §fe4, and let 

Si(p) := {ui, • • • , Vi] denote the sweep set of the first i vertices and let Xi :— vol(5j), then 7(p, Xi) = 
J2veSi(p)P( v ) = P(^(p))- P° r otner x values such that ajj_i < x < Xi, I(p,x) — 7(p, Xi-i) + 
(P) — 7(p, Xi-i)), that is, 7(p, x) is a piecewise linear function of x. Furthermore, 7(p, x) is 
a non-decreasing and concave function in x. Lovasz and Simonovits proved the following lemma on the 
potential function. 

Lemma 2 (|LS90, LS93Q. For any probability distribution p and any vertex set S with volume at most 
m,pW{S) < i(I(p,vol(S)(l + 4>(S)))+I(p,vol(S)(l - 0(5)))). 

3 The Small Set Expansion Tester 

In this section, we present our property testing algorithm for small set expansion and provide the proof 
of the easy part of the main theorem. 

SSETester(G,fc,0, e) 

1. Repeat M = 6(l/e) times: 

(a) Pick a random vertex v with probability proportional to its degree. 

(b) If Detector(G, v, k, 4>) returns found then abort and output reject. 

2. In case no call to Detector returned found then output accept. 

The subroutine Detector is defined as follows. 

Detector(G, v,k,<j>) 

1. Perform N lazy random walk of length T from v, where N := 30T 2 fclnfc and T := 
8lnk/(f> 2 . 

2. For each length i = 0, 1,2, ...,T: 

(a) Let AT (it) denote the number of walks of length t that ends at vertex it. Order the 
vertices in decreasing order of the ratio of N(u)/d(u), breaking ties arbitrarily. 

(b) Let Si be the first i vertices in this ordering for any 1 < i < n. Compute the 
conductance of each set Si with volume at most k. 

(c) If the conductance of any such set is less than (j>, then return found. 

3. Return not-found. 



We want to show that SSETester has the performance guarantees as given in Theorem[T] The run- 
ning time of the tester can be bounded as follows. In the subroutine Detect or, for each t < T,the com- 
putation involves sorting vertices according to the ratio N(u) /d(u), which takes time 0(N log N) (since 
there are at most N vertices with nonzero p t values, we only need to sort these N vertices and order other 
vertices arbitrarily); and computing the conductance for each small sweep sets, which takes time 0(k), 
respectively. Therefore, each call to Detector takes time 0(T(N\ogN + k)) = 0(/cln 5 k/4> & ). The 
total time of the tester involves M calls to Detector and thus is 0( kl e 1 l 6 k ). 
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Now note that if G is a (fc, if) -expander, then there is no subset with volume at most fc and con- 
ductance less than <fi, and thus Detector will return not-found and SSETester will output accept. 
Therefore, to prove Theorem Q] it remains to prove the soundness of the tester, that is, to show that if 
G is e-far from any (fc*, </>*) -expander, then SSETester rejects G with probability at least 2/3, where 
fc* = 0(k 1 ^^) and </>* = 0(£</> 2 ) for any < £ < 1/2. In the remaining of the paper, we will assume £ 
to be some fixed number less than 1/2. 



4 Proof of the Soundness of the Tester 

Before we give the details of the proof of the remaining part of Theorem Q] we give a high level 
exposition of the ideas. Recent progress on local graph partitioning algorithms given by Oveis Gharan 
and Trevisan HOT 121 (and independently Kwok and Lau[KL12|) shows that if the graph contains a subset 

5 with volume at most k and conductance at most <fi, then by choosing "proper" start vertex v £ S and 
computing the conductance of all the sweep sets of p t — \ v W f with volume at most 0(k 1+ ^ ) for any 
< £' < 1/2 and t < T, we are guaranteed that at least one such sweep set has conductance 0(<J </>/£')> 
where T is a properly chosen parameter. 

The above idea is used in the design and analysis of our tester. We show that if the input graph G 
is e-far from any (fc*, </>*) -expander, then there exists "many" such "proper" start vertices (called weak 
vertices) from which the random walk probability distributions can be used to find small non-expanding 
sets. The existence of "many" weak vertices ensures that we can sample out at least one weak vertex with 
high probability. We prove this fact by contradiction and show that if there are "few" weak vertices in G, 
then there exists a patch-up procedure that changes at most em edges of G to get a (A;*, 0*) -expander. The 
proof of this part is almost the same as the corresponding part in the expansion tester BKS1 lbl ILPP1 II 
and is given in Section FOl 

One important caution here is that in order to design a sublinear time tester, we are not able to 
compute the exact probability distribution p t . Instead we perform multiple lazy random walks on the 
graph to obtain empirical distributions p t , which can be computed by inspecting only a small portion of 
the graph. For any vertex u, p t (u) is the fraction of lazy random walks of length t that hit on u. We 
show that p t is a good estimation of p t , and the sweep sets of p t can also be used as candidate sets to 
find small sets of small expansion. 

In the remaining of this section, we first prove that the empirical distribution p t is a good approxi- 
mation to p t for any t < T, and then give the definition of weak vertices for which Detector returns 
found. After that we show that if G is e-far from any (fc*, 0*) -expander, then the set of weak vertices 
has volume at least O(em), which is large enough to ensure our tester sampling out at least one weak 
vertex and thus rejecting the graph. 



4.1 Analysis of the Random Walks in Detector 

Assume that the random walks in Detector start from some vertex v. For each t < T = 81nfc/<^> 2 , 
define the empirical distribution in the Detector as p t (u) := N(u) /N, where N(u) is the number of 
walks of length t that end at u and N = 30T 2 fc In fc is the total number of random walks. Note that Si 
in step dZbl of the subroutine Detector is the sweep set of the first i vertices of p 4 . We first show that 
p t is a good approximation to the exact probability distribution p t = l v W l . 

Lemma 3. For any vertex u and t < T, with probability at least 1 — fc~ 10 , 

\p t (u) - Pt {u)\ < X u := ^(p t (u) + fc" 1 ). (1) 
Furthermore, for any x < fc, with probability at least 1 — fc -8 , 

(1 - ^)I(Pt,x) ~^f< UPt, x)<(l + ^)I(p t ,x) + (2) 
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Proof. By the fact that E[p t (u)] = p t (u) and the Bernstein's inequality, we have that 

, NX 2 
Pr[|p t (u)-p t (tt)| < A„] < 2exp - 



2(p t (u) + A M /3)> 

~ 2eXP l 3( Pt (u) + k-i) )^ 1/k 

Now let £ denote the event that inequality (Q~|) holds for all vertices in the sweep sets Sj (p t ) and 5; (p t ) 
of volume at most k and we will condition event £, which occurs with probability at least 1 — k~ a . Let 
Xi = vol(5j(p t )) and Xi — vol(5j(p t )). Then by the definition and properties of the potential function, 
we have that 

I(p t)Xi ) > p t (Si(p t )) > (1 - i)R(Si(p t )) - ^ > (1 - f)I(j> t ,Xi) - £ 

and that 

I(p t ,Xi) > p t (5<(p t )) > T -l^(p t (^(p ( )) _ JL) > _L_(/(p tj£i ) _ 

Finally inequality (fJJ follows immediately from the fact that both J(p t , x) and 7(p t , x) are piecewise 
linear, concave and non-decreasing. □ 

Let p t and p t be as defined above. We now give an upper bound on 7(p t , x) when all the small sweep 
sets are expanding. 

Lemma 4. If for all t < T, the sweep sets Si(p t ) of volume at most k has conductance at least <j), then 
with probability at least 1 — fc~ 8 , for any x < 2m, 

I(p t ,x) < e^ T (l + - f )*) +3e 4t /^. (3) 

Proof. Assume the event £ in the proof of Lemma|3]holds, which occurs with probability at least 1 — fc~ 8 . 
Thus inequality (O also holds. Let ii = vol(5j(p ( )) < k. Since Si(p t ) has conductance at least </>, 

I(p t ,Xi)=p t (Si(p t )) < (1 + ~)p t ($(Pi)) + ^ 

< ^-^(/(P t -i,£i(i + 0)) + J(Pt-i,5*(i - <j>))) + ^ 

1 4- 1 IT 2t t- 

< 2(1 _ 1/T) ( f (Pt-i.£«(l + *)) + ^(Pt-i.2i(l ~ 0)) + ^) + ^ 

e 3 / T 3i 

< __(J(p t _ 1 ,x i (l + cj>))+ 7(p t _ llXi (l-0))) + -^, (4) 

where the first inequality follows our assumption, the second inequality follows from Lemma|2] the last 
two inequalities follow from inequality E] and the fact that pzjy^ < min{2, e 3 / T }, respectively. 

Now we show by induction on t that the statement of lemma holds. For t = 0, the inequality <£3j 
holds trivially since the LHS is at most min{l, x/d(v)} and the RHS is at least y/x/d{v). Now assume 
the lemma holds for t — 1. Since J(p t , x) is a piecewise linear function of x, and the RHS is a concave 
function, it is suffices to prove the statement for values of ii = vol(5j(p t )). If Xj > k, the statement 
holds since the RHS is at least 1, which is an upper bound for the LHS. Now assume Xi < k. By 
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inequality ©, we have 

e 3/T 3x 



< 



' ' r #-w(*£ + v^)(i - £ )" 1 (VTT0 + «) 



+6e 4(t_1)/T — 



fcT 



< e 3t / T (| + v ^R(l-y) )+(3 e 3/T e 4(t " 1 )/ T + 3/T) fc 

where the second inequality follows from induction, the third inequality follows from the fact that 
y/T+l> + y/T~$ < 2 - ^ for any (j> < 1. □ 

Now we give the definition of weak vertices and show that if the lazy random walks in Detector 
start from some weak vertex, then Detector will find a small non-expanding set. Let k* := ck 1 ^^ for 
some constant c to be specified later. 

Definition 3. A vertex v is weak ifI(l v W T , k*) > c\k~ S where c\ is the constant in Lemma\J] 

We have the following lemma. 

Lemma 5. If v is weak, then with probability at least 1 — fc~ 8 , Detector(G, v, k, 4>) finds a set of 
volume at most k and conductance <fi and thus returns found. 

Proof. We will condition on the event £ in Lemma[3]that occurs with probability at least 1 — fc~ 8 . Let 
p t := lyW 1 and p t be the corresponding empirical distribution of the lazy random walk of length t 
starting from v. By the fact that v is weak and Lemma[3] 

1 k* 

iYp T , k*) > (1 — — )I(p T , k*) — —— > 0.99cifc~^ (for k larger than some constant) 
T kT 

Now assume that the subroutine Detector(G, v, k, <fi) does not find a set of volume at most k and 
conductance at most (j>, that is, all the sweep sets Si(p t ) of volume at most k have conductance at least 
0, then by Lemma|H 



< c 2 + Vk*(l - ( for constant c 2 > e 3 + 3e 4 ) 

< 0.99ci/c (for £ < 1/2 and fe larger than some constant) 

which is a contradiction. □ 



4.2 The Existence of "Many" Weak Vertices 

Finally, we show that if G is far from any small set expander, then G contains a large set of weak vertices 
that has a constant fraction of volume of G. More precisely, we have the following theorem. 

Theorem 2. If G = (V, E) is e-farfrom any [k* , (j)*)- expander, then the set of weak vertices has volume 
Q(em). 
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Before we prove this theorem, we show how it can be used to prove the soundness of the tester and 
thus complete the proof of the main theorem. 

Proof of Theorem\Tj By Theorem^ we know that if G is e-far from any (k* , 0*)-expander, then with 
probability at least 5/6 (on the randomness in the sampling), the tester SSETester will sample out 
a weak vertex since it picks a vertex with probability proportional to its degree, which means that at 
each time a weak vertex is chosen with probability 0(e), and the number of samples is 0(l/e). Once 
a weak vertex is sampled out, we know that with probability at leat 1 — fc~ 8 (on the randomness in the 
random walks), the subroutine Detector will find a small non-expanding set and return found, for 
which SSETester will reject the graph. Therefore, the probability that the tester outputs reject is at 
least 2/3. This completes the proof. □ 

Now we devote to prove Theorem [2] for which we prove by contradiction. That is, we show that if 
the volume of the set of weak vertices is small, then G is not e-far from (fc*, <$>*) -expander. The proof 
is composed of the following two lemmas. The first one (Lemma |6]l shows that if there are few weak 
vertices, then there is a "good" partition of G. The second one (Lemma|7]i shows that the "good" partition 
can be used by a patch-up algorithm that modifies a small fraction of edges of G and gets a small set 
expander. This part of the proof is very similar to the corresponding part of testing expansion IKSl lbl 
ILPP11I . 

We first introduce some notations. For S C U C V, let voljj(S) and <fiu(S) be the volume and 
conductance of S in the subgraph of G induced by vertices in U . We will omit the subscript when 
U = V. Now we are ready to give our lemmas. 

Lemma 6. Given a graph G = (V, E), if the volume of set of weak vertices is at most ^|p, then there 
exists a partition ofV, written as V = A\ U • • • U A s U B, where Ai, ■ ■ ■ , A s , B are mutually disjoint, 
satisfying that vol(Ai U • • • U A s ) < ^ and for any i < s, vol(Ai) < k* and </>(A) < $ := 
Furthermore, the induced subgraph on vertex set B is a (k* , 3>)-expander. 

Proof. Let Aq be the empty set and let Vq ■— V. For each i > 1, if there is a subset Ai C Vi-i such 
that vol(A;) < k* and <\>Vi-x (Ai) < then we remove Ai from Vi~\ and let Vi := Vi_i\A;. We repeat 
this process no such subset exists. Assume that the last set we removed is A s . Then by construction, we 
know that the induced subgraph on B := V s = V\(Ai U • • • U A s ) is a (k* , $) -expander. 
Noting that 

e{A i: V\Ai) = e(Ai,Vi-.i\Ai) + e(A i ,A l U • • • U A^), 

and that 

vol(A) = volw, (Ai) + e(A,, Ai U • ■ ■ U Aj-i), 

we have 

<t>{Ai) < <t> Vi _ t (Ai) = e(A,^_ 1 \A i )/vol 1/i _ 1 (A) < 

Now by Lemma [U for each At such that 1 < i < s, there exists a large subset Ci C A such that 

vol(Ci) > vol(Aj)/2 and, for any v £ C i7 

I(l v W T ,k*) > W T (A) > ci(l - ^-f = Cl (l - ^) V > Cl fc-«, 

2 lb 

which means that v is weak. 

Now assume that vol(Ai U • • • U A s ) > ^p, then the volume of the set of weak vertices is at least 
(vol(Ci) + ■ • • + vol(C s ))/2 > which is a contradiction. This completes the proof. □ 

Now let <f>* = c'£cf) 2 for some constant d to be specified later. 
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Lemma 7. If a graph G has a partition as in Lemma\6\ then we can modify at most em edges of G to 
obtain a (k*, (/)*) -expander. 

The proof uses the following patch-up algorithm. 

Patch-Up(G) 

1 . Partition G into A\ , ■ ■ ■ , A s , B as in Lemma[6] 

2. Remove all the edges incident to some vertex in Ai for all i < s. 

3. For any i < s and each vertex u £ A i7 repeat the following process until its degree 
becomes d(u) or d(u) — 1: 

(a) Choose a vertex v £ B with probability d(y) /vol(B). 

(b) If the current degree of v is less than d(v), then add an edge between u and v. 

(c) Otherwise, if there is an edge (v, w) such that w e B, then delete edge (v, w) and 
add edges (u, v) and (u, w). 

(d) Otherwise, resample the vertex v E B. 



Proof Sketch of Lemma\7\ Li, Pan and Peng [LPP1 1 1 ensured the termination of step (01 in the algorithm 
Patch-Up in that every vertex v in Ai will reach degree d(v) or d(v) — 1, and showed that the number 
of edges modified during the whole process is at most em. Furthermore, they proved that if the vertex 
subset B in the partition is a ^-expander, then the final graph is a C3$-expander for some constant C3. 

Thus, to prove Lemma|7] we only need to show that if the subset B is a (k*, $)-expander (as guaran- 
teed by Lemma|6]l, then the final graph is also a (k* , 03$) -expander. This follows directly by applying 
the arguments in IILPP1 11 for studying the conductance of all subsets of volume at most rn to studying 
the the conductance of all subsets of volume at k*. Finally, we set d := || so that the final graph is a 
(k*, 0*)-expander. □ 

Proof of Theorem^ The proof follows directly from Lemma|6]and Lemma|7] □ 



5 Conclusions 

We use the analysis of lazy random walks from the local graph partitioning algorithms to give a 
one-sided small set expansion tester SSETester for general graphs. The tester distinguishes (k, </>)- 
expanders and graphs that are e-far from (fc*, </!>*) -expanders where k* = 0(fc 1_, ») and cf>* — 0(£,<j> 2 ) for 
any £ < 1/2. The running time of SSETester is slightly super linear in k and is sublinear in n only 
when k = O( logfi "i) - ). In particular, the tester can not be served as an expansion tester, since when 
k = m, the running time our tester is suplinear in n. It will be very interesting to design a tester with 
running time sublinear in k. A more challenging problem is to reduce the gap in both the volume and 
conductance of our tester, with the goal of finding a tester that can distinguish (k, 0)-expanders from 
graphs that are e-far from (k, 0)-expanders. 
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